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CAROTENOID KETOLASE GENES AND GENE PRODUCTS, 
PRODUCTION OF KETOCAROTENOIDS AND METHODS OF 
MODIFYING CAROTENOIDS USING THE GENES 

RAP.KfiROt IMP OF TH F INVENTION 
5 Carotenoids are widely distributed natural pigments that are responsible for 

many of the yellow, orange and red colors seen in living organisms. They have 
important commercial uses as coloring agents in the food industry, as feed and food 
additives, in cosmetics and as provitamin A precursors. 

The plant species Adonis aestivalis produces flowers with petals that are deep 
10 red in color and nearly black at the base of the petals due to the accumulation of 
ketocarotenoid and other carotenoid pigments (Neamtu et al., Rev. Roum. Biochim. 
6:157, 1969). This pattern of carotenoid accumulation accounts for the common name 
of some varieties of this species: summer pheasant's eye. 

Among the carotenoids identified in the petals of the red petal varieties of these 
15 various species is the ketocarotenoid astaxanthin (3,3'-dihydroxy-4,4 , -diketo-b,b- 
carotene; see Figure 1). Various other ketocarotenoids (see Figure 1) including 3- 
hydroxyechinenone ( 3-hydroxy-4-keto-b,b-carotene), adonirubin (3-hydroxy-4,4'-diketo- 
b.b-carotene) adonixanthin (3,3'-dihydroxy-4-keto-b,b-carotene) and isozeaxanthin 
(4,4'-dihydroxy-b,b-carotene; see T.W. Goodwin, The Biochemistry of the Carotenoids, 
20 vol I. Plants, 2nd edition, 1980, page 147) have also been reported. The latter 
compound is consistent with speculation that the 4-hydroxy may be an intermediate in 
the formation of the 4-keto group. 

SI IMMARY OF THF INVFNTION 
There is appreciable interest in the biological production of carotenoids, in 

25 particular the orange-colored ketocarotenoids such as astaxanthin and canthaxanthin 
(Figure 1), and in the modification of carotenoid composition. For this reason, an A. 
aestivalis flower cDNA library was constructed and screened for cDNAs encoding 
enzymes (hereinafter referred to as "ketolases" although the specific biochemical 
activity has not yet been established) involved in the conversion of b-carotene into 

30 orange compounds with absorption properties similar to those exhibited by common 
ketocarotenoids such as canthaxanthin (Figure 1). Two distinctly different Adonis 
aestivalis cDNAs were obtained from among a number of cDNAs that were selected on 
this basis. 
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Thus, a first aspect of the present invention is a purified nucleic acid sequence 
which encodes for a protein having ketolase enzyme activity and has the nucleic acid 
sequence of SEQ ID NO: 1 or 3. 

The invention also includes a purified nucleic acid sequence which encodes for 
5 a protein having ketolase enzyme activity and having the amino acid sequence of SEQ 
ID NO: 2 or 4. 

The invention also includes vectors which comprise any portion of the nucleic 
acid sequences listed above, and host cells transformed with such vectors. 

Another aspect of the present invention is a method of producing a 
10 ketocarotenoid in a host cell, the method comprising 

inserting into the host cell a vector comprising a heterologous nucleic acid 
sequence which encodes for a protein having ketolase enzyme activity and comprises 
(1 ) SEQ ID NO: 1 or 3 or (2) a sequence which encodes the amino acid sequence of 
SEQ ID NO: 2 or 4, wherein the heterologous nucleic acid sequence is operably linked 
15 to a promoter; and 

expressing the heterologous nucleic acid sequence, thereby producing 

the ketolase enzyme. 

Another subject of the present invention is a method of modifying the production 
of carotenoids in a host cell, relative to an untransformed host cell, the method 
20 comprising 

inserting into a host cell which already produces carotenoids a vector 
comprising a heterologous nucleic acid sequence which encodes for a protein having 
ketolase enzyme activity and comprises (1) SEQ ID NO: 1 or 3 or (2) a sequence which 
encodes the amino acid sequence of SEQ ID NO: 2 or 4, wherein the heterologous 
25 nucleic acid sequence is operably linked to a promoter; and 

expressing the heterologous nucleic acid sequence in the host cell to 
modify the production of the carotenoids in the host cell, relative to an untransformed 
host cell. 

RRIFF DESCRIPTION OF THE DRAWINGS 
30 A more complete appreciation of the invention and many of the attendant 

advantages thereof will be readily obtained as the same becomes better understood by 
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reference to the following detailed description when considered in connection with the 
accompanying drawings. 

Figure 1 illustrates structures and biochemical routes leading from b-carotene to various 
of the ketocarotenoids referred to in the text. Conversion of p-carotene to astaxanthin 
5 by a hydroxylase enzyme (Hy) and a ketolase enzyme (keto) could proceed via any one 
or all of several possible routes depending on the order of the reactions. 

Figure 2 illustrates the beta ring structure of b-carotene and various modifications of this 
parent ring that might be produced through the action of the products of the A. aestivalis 
ketolase cDNAs. Also shown is the structure of the epsilon ring, not found to be a 
10 substrate for the A. aestivalis ketolases and present in carotenoids such as d-carotene, 
e-carotene, a-carotene and lutein. 

Figure 3 illustrate results obtained with TLC (thin layer chromatography) separation of 
carotenoid pigments extracted from E. coli cultures, previously engineered to produce 
b-carotene, but that now also contain the A. aestivalis ketolase cDNAs and/or other 

15 introduced genes and cDNAs. The Figure indicates the empty plasmid vector 
pBluescript SK- (SK-), the Adonis aestivalis ketolase 1 cDNA in this plasmid vector (Ad 
ketol), the Haematococcus pluvialis ketolase cDNA in this plasmid vector Hp keto), or 
the Arabidopsis p-carotene hydroxylase cDNA (At Ohase). Bands that were orange in 
color are shown here with a darker fill than those with a yellow color. Identities of 

20 various bands are indicated to the right of the band. 

Figure 4 illustrates the absorption spectrum of one of the orange carotenoids produced 
from b-carotene via the action of the Adonis ketolases and makes clear the similarity 
of the spectrum to that of canthaxanthin. Absorption spectra (in acetone) of p-carotene, 
canthaxanthin and an unknown orange product (orange band #1 ; the lower orange 
25 band in the first lane of Figure 3) extracted from cultures after introduction of the Adonis 
aestivalis ketol cDNA (SEQ ID NO: 1) in cells of E. coli that otherwise produce and 
accumulate p-carotene. The absorption spectrum of the unknown resembles that of 
canthaxanthin but the compound migrates to a position below echinenone on RP18 
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TLC plates developed with a mobile phase of methanol:acetone (1:1 by volume). The 
absorption spectrum of orange band #2 also is similar to that of canthaxanthin but it 
migrates more rapidly than canthaxanthin indicating that it is probably a more polar 
compound. 

5 Figure 5 shows SEQ ID NO: 5 (the sequence shown in this Figure includes SEQ ID NO: 

1 and also includes some of the flanking DNA from the adapator DNA and the multiple 
cloning site (MCS) of the library cloning vector, which sequences are shown in bold). 

Figure 6 shows SEQ ID NO: 6 (the sequence shown in this Figure includes SEQ ID NO: 

2 and also includes a translation of amino acids resulting from the adapator DNA and 
1 0 the multiple cloning site (MCS) of the library cloning vector and the start codon from the 

plasmid vector pTrcHis, which sequences are shown in bold and capitalized). 

Figure 7 shows SEQ ID NO: 7 (the sequence shown in this Figure includes SEQ ID NO: 

3 and also includes some of the flanking DNA from the adapator DNA and the multiple 
cloning site (MCS) of the library cloning vector, which sequences are shown in bold). 

1 5 Figure 8 shows SEQ ID NO: 8 (the sequence shown in this Figure includes SEQ ID NO: 

4 and also includes a translation of amino acids resulting from the adapator DNA and 
the multiple cloning site (MCS) of the library cloning vector and the start codon from the 
plasmid vector, which sequences are shown in bold and capitalized). 

Figure 9 shows a "Gap" alignment of the two Adonis ketolase sequences of the 
20 invention. A truncated version of SEQ ID NO: 1 is shown in this Figure for comparitive 
purposes, and is designated SEQ ID NO: 9. The percentage identity was calculated 
to be 91.107. 

Figure 10 shows a "Gap" alignment of SEQ ID NO: 2 and 4. The following results were 
found: 

25 Gap weight: 12 average match: 2.912 



Length weight: 4 



average mismatch: -2.003 
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Quality: 1440 length: 307 

Ratio: 4.691 gaps: 0 

percent similarity: 92.182 percent identity: 90.228 

Figure 1 1 shows a comparison between SEQ ID NO: 2 and the Arabidopsis thaliana (3- 
5 carotene hydroxylase enzyme (GenBank U58919) (SEQ ID NO: 10). 

Figure 12A shows gDNA (SEQ ID NO: 11) immediately upstream of the cDNA of SEQ 
ID NO: 3. The sequence was obtained from a PCR product generated using the 
GenomeWalker kit of Clontech Laboratories, Inc. (1020 East Meadow Circle, Palo Alto, 
CA 94303-4230) and nested primers specific to the ketolases of Adonis aestivalis 
10 (cagaatcggtctgttctattagttcttcc (SEQ ID NO: 17) and caatttgaggaatatcaaggttccttgttctc 
(SEQ ID NO: 18)). The termination codon upstream of and in-frame with initiation 
codon (TAA at positions 204-206) is shown in bold. Initiation codon (ATG) is also 
shown in bold. 

Figure 12B (SEQ ID NO: 12) indicates that the full length polypeptide of SEQ ID NO: 
15 4 begins with the amino acids MAA (shown in bold) immediately preceding the ketolase 
sequence shown in Figure 8. A similar MAA amino acid sequence immediately 
preceding SEQ ID NO: 1 is also expected. 

Figure 13 shows an alignment of SEQ ID NO: 2, SEQ ID NO: 12, an Arabidopsis p- 
20 carotene hydroxylase enzyme (predicted product of GenBank U58919) (SEQ ID NO: 
13), a putative second Arabidopsis hydroxylase predicted by genomic DNA sequence 
(GenBank AB025606; the exon/intron junctions were chosen with reference to the 
product of the Arabidopsis p-carotene hydroxylase cDNA u58919) (SEQ ID NO: 14), 
and two Capsicum annuurn p-carotene hydroxylases (predicted products of GenBank 
25 Y09722 and Y09225) (SEQ ID NO: 1 5 and 1 6). 

INSCRIPTION OF THF PREFERRED EMBODIMENTS 
The present invention is directed to a purified nucleic acid sequence which 
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encodes for a protein having ketolase enzyme activity and has the nucleic acid 
sequence of SEQ ID NO: 1 or 3. 

The invention also includes a purified nucleic acid sequence which encodes for 
a protein having ketolase enzyme activity and having the amino acid sequence of SEQ 
5 ID NO: 2 or 4. 

Two different but closely-related nucleic acids have been isolated. The 
sequences of the longest example of each are presented herein. Sequencing which 
has subsequently been conducted of upstream genomic DNA indicates that SEQ ID 
NO: 3 lacks bases encoding the first three amino acids (MAA; see Figure 12). Likely, 

1 0 this is also the case for SEQ ID NO: 1 , but the upstream genomic sequences have not 
yet been obtained for this nucleic acid. 

The two different Adonis ketolases denoted in SEQ ID NO: 1 and 3 are similar 
in sequence, sharing about 91 % identity, as determined by the Gap program discussed 
below (see Figure 9). The predicted amino acid sequences of the enzymes denoted in 

15 SEQ ID NO: 2 and 4 share about 92% similarity and about 90% identity, also as 
determined by the Gap program (see Figure 10). 

Therefore, it is clear that certain modifications of SEQ ID NO: 1 or 3 or SEQ ID 
NO: 2 or 4 can take place without destroying the activity of the enzyme. Note also that 
certain truncated versions of the cDNAs of SEQ ID NO: 1 or 3 were found to be 

20 functional (i.e., these cDNAs retained the property of causing the conversion of b~ 
carotene to orange compounds). Also, the Arabidopsis p-carotene hydroxylase 
(GenBank U58919), aligned with the ketolase SEQ ID NO: 2 in Figure 11, retains 
catalytic function when truncated to yield a polypeptide that lacks the first 129 amino 
acids (Sun et al., 1996). From the alignment in Figure 1 1 , therefore, this would suggest 

25 that the two ketolases of the invention retain catalytic activity after truncation to remove 
bases encoding the first 132 amino acids. 

Thus, the present invention is intended to include those ketolase nucleic acid 
and amino acid sequences in which substitutions, deletions, additions or other 
modifications have taken place, as compared to SEQ ID NO: 1 or 3 or SEQ ID NO: 2 

30 or 4, without destroying the activity of the ketolase enzyme. Preferably, the 
substitutions, deletions, additions or other modifications take place at those positions 
which already show dissimilarity between the present sequences. For SEQ ID NO: 1 , 
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as shown in Figure 9, these positions are as follows: positions 7, 20, 23, 35, 53, 63, 65, 
67, 76, 78, 85, 86, 91, 107, 109-111, 135, 140, 144, 146, 160, 168, 217, 219, 241, 249, 
254, 256, 271, 291, 296, 349, 389, 400, 406, 431, 448, 449, 460, 471, 499, 530, 589, 
619, 643, 653, 654, 667, 679, 709, 731, 742, 784, 787, 836, 871, 883, 896, 911, 919, 
5 928, 930, 939, 943, 967, 969, 978, 979, 982, 988, 995, 1005, 1006, 1012-1014, 1017, 
1019-1021. 1023, 1025, 1049, 1050, 1054, 1060-1068, 1070-1073, 1075, 1094, 1100, 
1101, 1106, 1107, 1109 and 1111-1176. For SEQ ID NO: 3, as shown in Figure 9, 
these positions are as follows: positions 7, 20, 23, 35, 53, 63, 65, 67, 76, 78, 85, 86, 91, 
107, 109-111, 135, 140, 144, 146, 160, 168, 217, 219, 241,249, 254, 256, 271,291, 
1 0 296, 349, 389, 400, 406, 431 , 448, 449, 460, 471 , 499, 530, 589, 61 9. 643, 653, 654, 
667, 679, 709. 731, 742, 784, 787, 836, 871, 883, 896, 911, 919, 928, 930, 939, 943, 
966. 967. 970. 979. 980. 983, 989, 996, 1006, 1007, 1013-1015, 1018, 1020-1022, 
1024, 1026, 1050, 1051, 1055, 1062-1065, 1067, 1086, 1092, 1093, 1098, 1099, 1101 
and 1103-1112. 

15 For SEQ ID NO: 2 and 4, as shown in Figure 10, the following amino acids can 

be substituted or deleted, or additions or other modifications can be made, without 
destroying the activity of the ketolase enzyme: positions 7, 8, 12, 18, 21 , 22, 25, 26, 36, 
37, 45, 47-49, 56, 73, 83, 85, 97, 99, 130, 144, 150, 157, 166, 218, 244, 279, 299 and 
304. Therefore, the present invention also intends to cover amino acid sequences 

20 where such changes have been made. 

In each case, nucleic acid and amino acid sequence similarity and identity is 
measured using sequence analysis software, for example, the Sequence Analysis, Gap, 
or BestFit software packages of the Genetics Computer Group (University of Wisconsin 
Biotechnology Center, 1710 University Avenue, Madison, Wisconsin 53705), MEGAIign 

25 (DNAStar, Inc., 1228 S. Park St., Madison, Wisconsin 53715), or MacVector (Oxford 
Molecular Group, 2105 S. Bascom Avenue, Suite 200, Campbell, California 95008). 
Such software uses algorithms to match similar sequences by assigning degrees of 
identity to various substitutions, deletions, and other modifications, and includes 
detailed instructions as to useful parameters, etc., such that those of routine skill in the 

30 art can easily compare sequence similarities and identities. An example of a useful 
algorithm in this regard is the algorithm of Needleman and Wunsch, which is used in the 
Gap program discussed above. This program finds the alignment of two complete 
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sequences that maximizes the number of matches and minimizes the number of gaps. 
Another useful algorithm is the algorithm of Smith and Waterman, which is used in the 
BestFit program discussed above. This program creates an optimal alignment of the 
best segment of similarity between two sequences. Optimal alignments are found by 
5 inserting gaps to maximize the number of matches using the local homology algorithm 
of Smith and Waterman. 

Conservative (i.e. similar) substitutions typically include substitutions within the 
following groups: glycine and alanine; valine, isoleucine and leucine; aspartic acid, 
glutamic acid, asparagine and glutamine; serine and threonine; lysine and arginine; and 

10 phenylalanine and tyrosine. Substitutions may also be made on the basis of conserved 
hydrophobicity or hydrophilicity (see Kyte and Doolittle, J. Mol. Biol. 157: 105-132 
(1982)), or on the basis of the ability to assume similar polypeptide secondary structure 
(see Chou and Fasman, Adv. EnzymoL 47: 45-148 (1978)). 

If comparison is made between nucleotide sequences, preferably the length of 

15 comparison sequences is at least 50 nucleotides, more preferably at least 60 
nucleotides, at least 75 nucleotides or at least 100 nucleotides. It is most preferred if 
comparison is made between the nucleic acid sequences encoding the enzyme coding 
regions necessary for enzyme activity. If comparison is made between amino acid 
sequences, preferably the length of comparison is at least 20 amino acids, more 

20 preferably at least 30 amino acids, at least 40 amino acids or at least 50 amino acids. 
It is most preferred if comparison is made between the amino acid sequences in the 
enzyme coding regions necessary for enzyme activity. 

While the two different Adonis ketolase enzymes of the present invention are 
similar in sequence, previously-described bacterial (Misawa etal., 1995), cyanobacterial 

25 (Fernandez-Gonzalez et ah, 1997), and green algal (Haematococcus pluvialis; Lotan et 
aL, 1995; Kajiwara et al., 1995) (3-carotene ketolase enzymes bear little resemblance 
to the Adonis ketolases, although certain histidine motifs and features of the predicted 
secondary structure are common to the polypeptides predicted by both groups 
(Cunningham and Gantt, 1998). 

30 The present invention also includes vectors containing the nucleic acids of the 

invention. Suitable vectors according to the present invention comprise a gene 
encoding a ketolase enzyme as described above, wherein the gene is operably linked 
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to a suitable promoter. Suitable promoters for the vector can be constructed using 
techniques well known in the art (see, for example, Sambrook et a!., Molecular Cloning 
A l ahoratory Manual . Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, 1989; 
Ausubel et al., Current Protocols in Molecular Biology . Greene Publishing and Wiley 
5 Interscience, New York, 1 991 ). Suitable vectors for eukaryotic expression in plants are 
described in Fray et al., (1995; Plant J. 8:693-701) and Misawa et al, (1994; Plant J. 
6:481-489). Suitable vectors for prokaryotic expression include pACYC184, pUC1 19, 
and pBR322 (available from New England BioLabs, Bevery, MA) and pTrcHis 
(Invitrogen) and pET28 (Novagen) and derivatives thereof. The vectors of the present 

10 invention can additionally contain regulatory elements such as promoters, repressors, 
selectable markers such as antibiotic resistance genes, etc., the construction of which 
is very well known in the art. 

The genes encoding the ketolase enzymes as described above, when cloned 
into a suitable expression vector, can be used to overexpress these enzymes in a host 

1 5 cell expression system or to inhibit the expression of these enzymes. For example, a 
vector containing a gene of the invention may be used to increase the amount of 
ketocarotenoids in an organism and thereby alter the nutritional or commercial value or 
pharmacology of the organism. A vector containing a gene of the invention may also 
be used to modify the carotenoid production in an organism. 

20 Therefore, the present invention includes a method of producing a 

ketocarotenoid in a host cell, the method comprising 

inserting into the host cell a vector comprising a heterologous nucleic acid 
sequence which encodes for a protein having ketolase enzyme activity and comprises 
(1) SEQ ID NO: 1 or 3 or (2) a sequence which encodes the amino acid sequence of 

25 SEQ ID NO: 2 or 4, wherein the heterologous nucleic acid sequence is operably linked 
to a promoter; and 

expressing the heterologous nucleic acid sequence, thereby producing 

the ketocarotenoid. 

The present invention also includes a method of modifying the production of 
30 carotenoids in a host cell, relative to an untransformed host cell, the method comprising 
inserting into a host cell which already produces carotenoids a vector 
comprising a heterologous nucleic acid sequence which encodes for a protein having 
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ketolase enzyme activity and comprises (1) SEQ ID NO: 1 or 3 or (2) a sequence which 
encodes the amino acid sequence of SEQ ID NO: 2 or 4, wherein the heterologous 
nucleic acid sequence is operably linked to a promoter; and 

expressing the heterologous nucleic acid sequence in the host cell to 
5 modify the production of the carotenoids in the host cell, relative to an untransformed 
host cell. 

The term "modifying the production" means that the amount of carotenoids 
produced can be enhanced, reduced, or left the same, as compared to an 
untransformed host cell. In accordance with one embodiment of the present invention, 

10 the make-up of the carotenoids (i.e., the type of carotenoids produced) is changed vis 
a vis each other, and this change in make-up may result in either a net gain, net loss, 
or no net change in the amount of carotenoids produced in the cell. In accordance with 
another embodiment of the present invention, the production or the biochemical activity 
of the carotenoids (or the enzymes which catalyze their formation) is enhanced by the 

1 5 insertion of the ketolase enzyme-encoding nucleic acid. In yet another embodiment of 
the invention, the production or the biochemical activity of the carotenoids (or the 
enzymes which catalyze their formation) may be reduced or inhibited by a number of 
different approaches available to those skilled in the art, including but not limited to 
such methodologies or approaches as anti-sense (e.g., Gray et al..(1992), Plant Mol. 

20 Biol. 19:69-87), ribozymes (e.g., Wegener et al (1994) Mol. Gen. Genet 1994 Nov 
15;245(4):465-470), co-suppression (e.g. Fray et al. (1993) Plant Mol. Biol. 
22:589-602), targeted disruption of the gene (e.g., Schaefer et al. Plant J. 
11:11 95-1 206, 1 997), intracellular antibodies (e.g., see Rondon et al. (1 997) Annu. Rev. 
Microbiol. 51:257-283) or whatever other approaches rely on the knowledge or 

25 availability of the nucleic acid sequences of the invention, or the enzymes encoded 
thereby. 

Host systems according to the present invention preferably comprise any 
organism which is capable of producing carotenoids, or which already produces 
carotenoids. Such organisms include plants, algae, certain bacteria, cyanobacteria and 
30 other photosynthetic bacteria. Transformation of these hosts with vectors according to 
the present invention can be done using standard techniques. See, for example, 
Sambrook et al., Modular Cloning A Laborat ory Manual, Cold Spring Harbor 
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Laboratory, Cold Spring Harbor, NY, 1989; Ausubel et aI. B Current Protocols in 

MpifiH"lar Biology. Greene Publishing and Wiley Interscience, New York, 1991. 

Alternatively, transgenic organisms can be constructed which include the nucleic 

acid sequences of the present invention. The incorporation of these sequences can 
5 allow the controlling of carotenoid biosynthesis, content, or composition in the host cell. 

These transgenic systems can be constructed to incorporate sequences which allow 

for the overexpression of the various nucleic acid sequences of the present invention. 

Transgenic systems can also be constructed which allow for the underexpression of the 

various nucleic acid sequences of the present invention. Such systems may contain 
10 anti-sense expression of the nucleic acid sequences of the present invention. Such 

anti-sense expression would result in the accumulation of the substrates of the enzyme 

encoded by the sense strand. 

Having generally described this invention, a further understanding can be 

obtained by reference to certain specific examples which are provided herein for 
15 purposes of illustration only and are not intended to be limiting unless otherwise 

specified. 

FXAMPLE 1 

Isolation of plant cDNAs that convert b -oarotene into compounds with ketocarotenoid- 

like spectra 

20 A flower cDNA library from the plant Adonis aestivalis was introduced into a 

strain of Escherichia coli engineered to accumulate the yellow carotenoid pigment p- 
carotene (see Cunningham et al., Plant Cell 8:1613-26, 1996). This strain of E. coli 
normally forms yellow colonies when cultures are spread on a solid agar growth 
medium. Ketocarotenoids that are derived from b-carotene, such as echinenone and 

25 canthaxanthin (Figure 1 ), are, in contrast, orange to orange-red in color. Colonies that 
were orange rather than yellow in color were visually selected, and the DNA sequences 
of the Adonis aestivalis cDNAs within the plasmid vectors contained in these colonies 
were ascertained. Two distinct cDNAs were obtained from analysis of cDNA inserts in 
plasmids obtained from approximately 10 selected colonies. The DNA sequences of 

30 these two ketolase cDNAs are presented herein. 

The products produced by the ketolases of the invention which have been 
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expressed in a (3-carotene-accumulating strain of Eschericia coli have not yet been 
identified. As many as 5 or 6 different colored bands, in addition to the substrate P- 
carotene, may readily be discerned by C 18 TLC separation (see Figure 3). To provide 
appropriate standards to assist in identification, an H. pluvialis ketolase and an 
5 Arabidopsis P-carotene hydroxylase were separately introduced into the p-carotene- 
accumulating E. coli to produce echinenone (3-keto-p, P-carotene) and canthaxanthin 
(3,3-diketo-P, P-carotene) or p-cryptoxanthin (4-hydroxy-P,P-carotene) and zeaxanthin 
(4 ) 4 i -dihydroxy-p,p-carotene). None of the compounds formed in the presence of the 
ketolases of the invention (no difference was observed in products formed in the 

10 presence of the two different nucleic acid sequences of the invention) both migrate in 
the TLC system and have the absorption spectrum expected for echinenone, 
canthaxanthin, P-cryptoxanthin, or zeaxanthin. Two of the colored TLC bands produced 
in the presence of the Adonis ketolase cDNAs are orange in color. Orange band #1 
has an absorption spectrum similar to that of canthaxanthin (see Figure 4) but migrates 

15 in a position that indicates a polarity intermediate to echinenone and p-carotene. 
Orange band #2 also has an absorption spectrum like that of canthaxanthin but 
migrates in a position that indicates a polarity intermediate to canthaxanthin and 
zeaxanthin (see Figure 3). The absorption spectra and TLC results suggest that the 
two orange products could be desaturated at the 3-4 positions of both rings (3,4,- 

20 didehydro; see Figure 2). Orange band #1 (see Figure 3) might then be 3,4,3\4'- 
tetradehydro-P,p-carotene. To substantially affect the absorption spectrum of the 
substrate p-carotene, any modifications very likely involve a carbon that lies in 
conjugation with the conjugated chain of carbon-carbon double bonds that constitute 
the chromophore (Goodwin, 1980; The Bioche mistry of the Carotenoids. volume I; 2 nd 

25 edition, Chapman and Hall). For the spectra obtained, only the carbons at the number 
4 position of the two rings appear to be plausible locations for modification. The 
multitude and TLC migrations of the yellow and orange products produced from the 
symmetrical P-carotene, however, also indicates that the enzymes of the invention carry 
out more than a single type of reaction. The apparent homology of the ketolases of the 

30 invention to the Arabidopsis P-carotene hydroxylase would suggest that compounds 
with a hydroxy! at the 3 and/or 4 positions of one or both rings are another possible 
outcome (see Figure 2). In fact, such compounds have been identified in Adonis (see 
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above), and it has long been conjectured that a hydroxyl at position 4 is an intermediate 
in the formation of the 4-keto (e.g. crustaxanthin, a 3,3\4,4' tetrahydroxy carotenoid that 
might be a precursor for astaxanthin in the exoskeleton of the lobster). The histidine 
motifs and secondary structure in common to the hydroxylase and ketolase enzymes 
5 are characteristics of a large group of di-iron oxygenases whose members also include 
examples of desaturases (J. Shanklin, 1998, Ann. Rev. Plant Physiol. Plant Mol. Biol.), 
therefore a 3-4 desaturation (and/or perhaps a 2-3 desaturation in one or more of the 
yellow compounds) would also seem a plausible outcome. 

To summarize the results of this example for the Adonis ketolases of the 

10 invention, a number of different carotenoids, including two with ketocarotenoid-like 
spectra, are produced from P-carotene via the action of the products of either of the two 
different nucleic acids of the invention. These orange compounds appear to be the 
major products. Truncation and fusion of the cDNAs to a stronger promoter in the 
vector pTrcHis (Invitrogen) was detrimental to growth of E. coli but did result in 

15 improved yield of the most polar orange product (orange band #2 in Figure 3). 
Introduction of a cyanobacterial ferredoxin did not change the yield or relative amounts 
of the various products. Without being bound by theory, it may be that the 
ketocarotenoids produced in flower petals of Adonis actually include the as yet 
unidentified orange compounds that are produced in E. coli using the nucleic acids of 

20 the invention. 

EXAMPLE 2 
Substrate s pecificity of the Adon is ketolases 

Carotenoids with e rings are common in plants. The s ring differs from the b ring 
only in the position of the double bond within the ring (Figure 2). The e ring is reported 

25 to be a poor substrate for the Arabidopsis b-carotene hydroxylase (Sun et al., 1996). 
The Adonis ketolase cDNAs were introduced into strains of E. coli engineered 
(Cunningham et al., 1996) to accumulate carotenoids with one or two e rings (d- 
carotene and e-carotene), or the acyclic carotenoid lycopene. TLC analysis of acetone 
extracts revealed that these carotenoids were not modified by the Adonis ketolases. 

30 as indicated by a lack of any new products formed. Products produced in E. coli 
engineered to accumulate zeaxanthin (Sun et al., 1996) appeared to be the same as 
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for p-carotene accumulating cultures indicating that a 3-OH is likely to be one of the 
functional groups introduced to the b ring by the Adonis ketolases. The more polar 
orange band produced from b-carotene through the action of the Adonis ketolases (e.g., 
orange band 2 in Figure 3), therefore, could very well be S.S'-dihydroxy-SAS',^- 
5 tetradehydro-b,b-carotene. 

The references cited in the application, along with the following references, are 
incorporated by reference: 

Bouvier F, et al. (1998) Xanthophyll biosynthesis: molecular and functional 
characterization of carotenoid hydroxylases from pepper fruits (Capsicum annuum L.). 
1 0 Biochim Biophys Acta. 1 391 :320-8 

Breitenbach J, et al. (1996) Expression in Escherichia coli and properties of the 
carotene ketolase from Haematococcus pluvialis. FEMS Microbiol Lett. 140:241-6 

Cunningham FX Jr, Gantt E (1998) Genes and enzymes of carotenoid biosynthesis in 
plants. Ann Rev Plant Physiol Plant Mol Biol 49: 557-583 

1 5 Fernandez-Gonzalez B, et al. (1 997) A new type of asymmetrically acting beta-carotene 
ketolase is required for the synthesis of echinenone in the cyanobacterium 
Synechocystis sp. PCC 6803. J Biol Chem. 272:9728-33 

Fraser PD, et al. (1997) In vitro characterization of astaxanthin biosynthetic enzymes. 
J Biol Chem. 1997272:6128-35 

20 Fraser PD, et al. (1998) Enzymic confirmation of reactions involved in routes to 
astaxanthin formation, elucidated using a direct substrate in vitro assay. Eur J Biochem. 
252:229-36 

Harker M, et al. (1997) Biosynthesis of ketocarotenoids in transgenic cyanobacteria 
expressing the algal gene for beta-C-4-oxygenase, crtO. FEBS Lett. 404:129-34 
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Kajiwara S, et al. (1995) Isolation and functional identification of a novel cDNA for 
astaxanthin biosynthesis from Haematococcus pluvialis, and astaxanthin synthesis in 
Escherichia coli. Plant Mol Biol. 29:343-52 

Lotan T, et al. (1995) Cloning and expression in Escherichia coli of the gene encoding 
5 beta-C-4-oxygenase, that converts beta-carotene to the ketocarotenoid canthaxanthin 
in Haematococcus pluvialis. FEBS Lett. 364:125-8 

Misawa N, et al. (1995) Canthaxanthin biosynthesis by the conversion of methylene to 
keto groups in a hydrocarbon beta-carotene by a single gene. Biochem Biophys Res 
10 Commun.209:867-76 

Misawa N, et al. (1995) Structure and functional analysis of a marine bacterial 
carotenoid biosynthesis gene cluster and astaxanthin biosynthetic pathway proposed 
at the gene level. J Bacteriol. 177:6575-84 

Miura Y, et al. (1998) Production of the carotenoids lycopene, beta-carotene, and 
15 astaxanthin in the food yeast Candida utilis. Appl Environ Microbiol. 64:1226-9 

Shanklin J, et al. (1997) Mossbauer studies of alkane omega-hydroxylase: evidence for 
a diiron cluster in an integral-membrane enzyme. Proc Natl Acad Sci USA. 94:2981-6 

Shanklin J, Cahoon EB (1998) Desaturation and related modifications of fatty acids. 
Ann Rev Plant Physiol Plant Mol Biol 49: 611-641 

20 Wang CW, et al. Engineered isoprenoid pathway enhances astaxanthin production in 
Escherichia coli. Biotechnol Bioeng. 1999 Jan 20;62(2):235-41. 
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I claim: 

1 . A method of producing a ketocarotenoid in a host cell, the method comprising 

inserting into the host cell a vector comprising a heterologous nucleic acid 
sequence which encodes for a protein having ketolase enzyme activity and has the 
5 nucleic acid sequence of SEQ ID NO: 1 or 3, wherein the heterologous nucleic acid 
sequence is operably linked to a promoter; and 

expressing the heterologous nucleic acid sequence, thereby producing 
the ketocarotenoid. 

2. The method of claim 1, wherein the host cell is selected from the group 
10 consisting of a bacterial cell, an algal cell and a plant cell. 

3. A method of producing a ketocarotenoid in a host cell, the method comprising 

inserting into the host cell a vector comprising a heterologous nucleic acid 
sequence which encodes for a protein having ketolase enzyme activity and has a 
sequence which encodes the amino acid sequence of SEQ ID NO: 2 or 4, wherein the 
15 heterologous nucleic acid sequence is operably linked to a promoter; and 

expressing the heterologous nucleic acid sequence, thereby producing 
the ketocarotenoid. 

4. The method of claim 3, wherein the host cell is selected from the group 
consisting of a bacterial cell, an algal cell and a plant cell. 

20 5. A method of modifying the production of carotenoids in a host cell, relative to an 
untransformed host cell, the method comprising 

inserting into a host cell which already produces carotenoids a vector 
comprising a heterologous nucleic acid sequence which encodes for a protein having 
ketolase enzyme activity and has the nucleic acid sequence of SEQ ID NO: 1 or 3, 
25 wherein the heterologous nucleic acid sequence is operably linked to a promoter; and 
expressing the heterologous nucleic acid sequence in the host cell to 
modify the production of the carotenoids in the host cell, relative to an untransformed 
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host cell. 



6. The method of claim 5, wherein the host cell is selected from the group 
consisting of a bacterial cell, an algal cell and a plant cell. 

7. A method of modifying the production of carotenoids in a host cell, relative to an 
5 untransformed host cell, the method comprising 

inserting into a host cell which already produces carotenoids a vector 
comprising a heterologous nucleic acid sequence which encodes for a protein having 
ketolase enzyme activity and has a sequence which encodes the amino acid sequence 
of SEQ ID NO: 2 or 4. wherein the heterologous nucleic acid sequence is operably 

1 0 linked to a promoter; and 

expressing the heterologous nucleic acid sequence in the host cell to 
modify the production of the carotenoids in the host cell, relative to an untransformed 
host cell. 

8. The method of claim 7, wherein the host cell is selected from the group 
15 consisting of a bacterial cell, an algal cell and a plant cell. 

9. A purified nucleic acid sequence which encodes for a protein having ketolase 
enzyme activity and has the nucleic acid sequence of SEQ ID NO: 1. 

10. A purified nucleic acid sequence which encodes for a protein having ketolase 
enzyme activity and has the nucleic acid sequence of SEQ ID NO: 3. 

20 11. A purified nucleic acid sequence which encodes for a protein having ketolase 
enzyme activity and has a sequence which encodes the amino acid sequence of SEQ 
ID NO: 2. 

12. A purified nucleic acid sequence which encodes for a protein having ketolase 
enzyme activity and has a sequence which encodes the amino acid sequence of SEQ 
25 ID NO: 4. 
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13. A vector which comprises the nucleic acid sequence of any one of claims 9-12, 
wherein the nucleic acid sequence is operably linked to a promoter. 

14. A host cell which is transformed with the vector of claim 1 3. 

15. The host cell of claim 14, wherein the host cell is selected from the group 
5 consisting of a bacterial cell, an algal cell and a plant cell. 

16. The host cell of claim 14, wherein the host cell is a photosynthetic cell. 

17. The host cell of claim 14, wherein the host cell contains a ketocarotenoid. 

18. The host cell of claim 14, wherein the host cell contains modified levels of 
carotenoids, relative to an untransformed host cell. 

10 19. A purified ketolase enzyme which is encoded by the amino acid sequence of 
SEQ ID NO: 2. 

20. A purified ketolase enzyme which is encoded by the amino acid sequence of 
SEQ ID NO: 4. 
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FIGURE 3 
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FIGURE 4 
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Figure 5 [SEQ ID NO: 5] 



-23 








ctocaaaaat 


teggcacgag 


1 


agcaacc-Cd 


gcgt tcagta 


C Cldy U LCtL Lv- 


tttccacaaq 


aatctcttgt 


51 


tgcaczcaaa 


acaagacatt 


*— »^ ^ ^ m y™^ /*•> 

CLcaaccs^^- 


r* =5 1- a 1 1 1 a c t 


cttctctcca 


101 


crt.z— egg ~gg 


agtcgectat 


gagaaagaaa 


cioy ci ci >— • a. w w 


atactqeatq 


151 




gt tgcagaga 


gaaCaayyaa 


ccttaatatt 


cctcaaattg 


201 


aagaagagga 


agagaacgag 


gaagaactaa 


L. d. y d. Q. o a y ci v- - 


aaattctqqc 


251 




eaaagaaaac 


oct acrggggg 




aa coat ccac 


301 


tgccrccaz- 


grcgcacccg 


tatcttgtc" 


t* anna f rrf t 




351 


gaccrgc-g- 


:tacttcaag 


*— ^ ^ 4"- ^» ^ /~T^"» '* H 

tuuicacuyu 




tagagatatt 


401 


cctgtcgcag 


aaatggggat 


tacgt uiyCu 


y Lu 1- I- ^y ^ 


ergctgegat 


451 


tggcacggaa 


tttttgtcag 


gacggguuca 


^» ci ci c*y c*. » w w 


tcracacQatt 


501 


crtrgzgg^a 


cattcacaag 


^- ^» a 25 ^ 55 


acitcacaaaa 


aqqccgcttc 


551 


gagz ica^g 






cranttCCtQ 

w ^ t» w «^ ^5 


etattgetet 


601 




ggautctcaa 


aLyaauuCu l. 


rrttcctaaa 


acctgetttg 

^ •— ' 


651 


gtaccggz zz 


tggaacgaca 


y it- ^y tyyi-a 


tagcttacat 


ttttcttcac 


701 


aatcccc"" - 




gttcccagta 


gggcttattg 


caaacgtccc 


751 


tcac ~ ~ ccac 


cagcrggcug 


cagctcacca 


aatccatcac 


tcaggaaaat 


801 


ttcagggcgr 


accatttggc 


ctgttccttg 


gaccccagga 


attggaagaa 


851 


gtaagaggag 


gcactgaaga 


attggagagg 


gtgatcagtc 


gtacagctaa 


901 


aegaaegcaa 


tcatctacat 


gaatcaactc 


ttttacattt 


atgaggtttt 


951 


agttcatcgg 


tgttacaagt 


cacacatttg 


tgtcgttgta 


gtaattcaaa 


1001 


gttaccarac 


tcttttttag 


aatttttttt 


tgatgtatag 


gtcgcggagt 


1051 


tacggttaca 


aaggccaaat 


ctattgttgt 


ggaattccat 


tattaaaaat 


1101 


aaaaat taga 


-gtttgtagtt 


ttatctggtg 


atcaatatca 


atatatatta 


1151 


attaaagcaa 


aaaaaaaaaa 


aaaaaa ctcgag 
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Figure 6 [SEQ ID NO: 6] 

MGIiQEFGTR 

aisvfstsys fhknlllhsk qdilnrpcll fspwvesptn rkkkthraac 
icsvaertm Idipqieeee eneeelieqt dsgiihikkt Iggkqsrrst 
gsivapvscl gilsmigpav yfkfsrlmec gdipvaemgi tfaafvaaai 
gteflsgwvh kelwhdslwy ihkshhrsrk grfefndvfa iinalpaial 
inygfsnagi Ipgacfgtgl gttvcgmayi flhnglshrr fpvglianvp 
yfhklaaahq ihhsgkfqgv pfglflgpqe leevrggtee iervisrtak 
rtqsst* 
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Figure 7 [SEQ ID NO: 7] 

ggg ctgcaggaat tcggcacgag 

1 agcaatttca gtgttcagtt caggctattc tttctacaag aatctcttgt 
51 tggactcaaa accaaatatt ctcaaacccc catgcctgct attctctcca 
101 gttgtgatca cgtcgcctat cagaaagaaa aagaaacatg gtgatccatg 
151 tatctgctcc cttgcaggga gaacaaggaa ccttzgatatt cctcaaattg 
201 aagaagagga agagaatgtg gaagaactaa tagaacagac cgattccgac 
251 atagtgcata taaagaaaac actagggggg aaacaatcaa aacggcccac 
301 tggctccarc gtcgcacccg tatcr.gtct tgggatcctt tcaatgactg 
351 gacccgcrg- "-tacttcaag "tzcacggc taatggaggg tggaga-ata 
401 cctgtagcag aaatggggat tacgrttgcc acctrrgtcg ctgctgczgt 
451 tagcacggag tttttgtcag catgggttca caaagaactc tggcacgagt 
501 ctttgrggta cattcacaag tctcaccatc ggtcacgaaa aggccgcntc 
551 aagtzcaatg atgtgtttgc tarzartaac gcgcttcccg ctattgctct 
601 tatcaattat ggattctcca acgaaggcct ccttcctgga gcgtgcrrtg 
651 gtgtcggcct tggaacaaca gzcrgtggta tggcttacat ttttctccac 
701 aatggcccat cacaccgaag gtrcccagta tggcttarcg cgaacgcccc 
751 ttatttccac aagctggctg cagccacca aatacaccac tcaggaaaat 
801 ttcagggcgt accatttggc csgttccttg gacccaagga attggaagaa 
851 gtaagaggag gcactgaaga gtrggagagg gtaatcagtc gtacaaccaa 
901 acgaacgcaa ccatctacct gaatcaattt ttttacatat ataaggrttt 
951 agtttatcgg tgttataaaa tcacacatcc gtatcgtttt agtaagccaa 
1001 agttaagata cttccttctt agaatatttt ttgatgtata ggtcgcggat 
1051 atactgttac actattcgct gtggaattcc attataaaaa aataaaaaaa 
1101 aaaaaaaaaa aa ctcgag 
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Figure 8 [SEQ ID NO: 8) 

MGLQEPGTR 

aisvfssgys fyknilldsk pnilkppcll fspwimspm rkkkkhgdpc 
icsvagrtrn laipqieeee enveelieqt dsdivhikkt lggkqskrpt 
gsivapvscl gilsmigpav yfkf srlmeg gdipvaemgi tf atfvaaav 
gteflsawvh keiwheslwy ihkshhrsrk grf ef ndvf a iinalpaial 
inygfsnegl Ipgacfgvgl gttvcgmayi fihnglshrr fpvwlianvp 
yfhklaaahq ihhsgkfqgv pfglflgpke leevrggtee lervisrttk 
rtqpst.* 
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Figure 9: Gap of SEQ ID NO: 9 and SEQ ID NO: 3 
1 agcaatctcaotQttcagtacaagttattctttccacaagaatctcttgt 50 

1 1 1 1 1 1 | M 1 1 1 1 1 11 1 1 II HIIIIIMII iiiiiiiiiiimi 

1 agcaatttcaatgttcagttcaggttattctttctacaagaatctcttgt 50 
51 tgcactcaaaacaagacattctcaaccgcccatgtttgctcttctctcca 100 

,i i! miii i i i i ii i mi i i hum mi iimmiii 

51 tggacrcaaaaccaaatattctcaaacccccatgcctgctattctctcca 100 
101 gctgzac"=agrcgccratgagaaagaaaaagacacatcgtgctgcatg 150 

mill ! | I II M M 1 1 M 1 1 M M 1 1 M I MM 111 I MM 

101 gttgtgaccargrcgcctatgagaaagaaaaagaaacatggtgatccatg 150 
151 tatctgctczcxttgcagagagaacaaggaaccttgatattcctcaaattg 200 

1 1 1 1 1 1 1 1 1 Ml MM 1 1 II 1 1 1 M M 1 1 1 1 M M 1 1 1 I 1 1 M M M I 

151* tatctgctcegttgcagggagaacaaggaaccttgatattectcaaattg 200 
201 aagaaoaocaagagaacgaggaagaactaatagaacagacggattctggc 250 

IIIIIIIMillMM I I Ml HIM I II IIMMIII I I MM I I 

201 aagaagaccaagagaatgtggaagaactaatagaacagaccgattctgac 250 
251 ataattcacataaagaaaacgctaggggggaaacaatcaagacggtccac 300 

HI | Mi Ml MM MM IMMMIMIIMIIIII MM MM 

251 atagtgcatataaagaaaacactaggggggaaacaatcaaaacggcccac 300 
301 tggctccartgtcgcacccgtatcttgtcttgggatcctttcaatgatcg 350 

, 1 1 1 1 1 1 1 !M 1 1 1 1 1 M i I II 1 1 1 1 1 ! I M 1 1 M 1 1 i HI M 1 1 i » I I 

301 tggctccatrgtcgcacccgtatcttgtcttgggatcctttcaatgattg 350 
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Figure 9 (cont.) 

351 gacctgctgtctacttcaagttttcacggctaatggagtgtggagatatt 400 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 IMIIIIIH 

351 gacctgctgtttaettcaagttttcacggctaatggagggtggagatata 400 
401 cctgrcgcagaaatggggattacgttrgccgcctrtgttgctgctgcgat 450 

urn i ii iiiini i iMii i M ii in in i iimi m mi I 

401 cctgtagcagaaatggggattacgtttgccacctttgttgctgctgctgt 450 
451 cogcacaaaarttttgccaggatgggtzcacaaagaaccctggcacgact 500 

Miiniii Milium iiiiiiiiiiiiiiiiiiiiiiiiiM i 

451 tggcacggagtttttgtcagcatgggttcacaaagaactctggcacgagt 500 

m 

501 cctcgtggtacattcacaagtctcaccataggtcacgaaaaggccgcttc 550 

UNI I i I IIIMM llllll II III Ml MIIIMIIIIIUMIIM 

501 ctttgrggtacattcacaagtctcaccatcggtcacgaaaaggccgcttc 550 
55^ gagttcaataatgtgtttgctattattaacgcgcttcctgctattgctct 600 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 ] I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 IIIMIIIIM 

551 cagttcaatgacgtgtctgctattattaacgcgcttcccgctattgctct 600 
601 tatcaattatggattctcaaatgaaggcctccttcctggagcctgctttg 650 

Mllll IMMMIIMI imilllMMIIIIMIIIM IIIMM 

601 tatcaattatggattctccaatgaaggcctccttcctggagcgtgctttg 650 

m 

651 gtaccgatcttggaacgacagtctgtggcatggcttacatttttcttcac 700 

II iilllHIIIM 1 1 1 1 1 1 1 1 1 1 ! IMMIIIIMIUMMIM 

651 gtgtcggtcttggaacaacagtctgtggtatggcttacatttttcttcac 700 
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Figure 9 (cont.) 

701 aatggcccttcacaccgaaggttcccagtagggcttattgcaaacgtccc 750 

IIMIII! I II 1 1 1 1 I 1 1 1 1 1 MM! IN I I MINIM Mill III 

701 aacggcccaccacaccgaaggttcccagtatggcttattgcgaacgtccc 750 
751 ttac:"cacaagctggctgcagctcaccaaatccatcactcaggaaaat 800 

M | MM! MM M II I I I I MM MIIIM I 1 II IIMIII I I Ml I 

751 ttatt-.:=acaagccggc-.gcagctcaccaaatacaccactcaggaaaat 80.0 
801 ttcaagg-.gcaccactiggcc-gt-ccrrggaccccaggaatcggaagaa =50 

IIIIIMIIIIIIMIIIIIINIIIIIIIIIMI Mllllllllllll 

801 ttcagggcgcaccatrcggcccgttccttggacccaaggaattggaagaa 850 
_ 

851 gtaaaaggaggcactgaagaattggagagggtgatcagccgtacagctaa 900 

IIMIIIMMIIIIIIIII II I I M M M I IMIMMIMI Mill 

851 gtaagaggaggcactgaagagttggagagggtaatcagtcgtacaactaa 900 
901 acgaacgcaatcatctacaTGAatcaactcttttacatctatgaggtttt 950 

| || Ml MM MIIIM I II II Ml I Ml II II I Ml MIIIM 

901 acgaacgcaaccatctaccTGAatcaatttttttacatatataaggtttt 950 
951 agtttatcggcgtta.caagtcacacatttgtgtcgttgtagtaattcaa 999 

mllllllllllll H IIMIIII II Mill MUM MM 

951 agttratcggtgttataaaatcacacatccgtatcgttttagtaagtcaa 1000 

1000 agttaccatactcttttttagaatttttttttgatgtataggtcgcggag 1049 

HIM Mill M I M 1 1 II I M I I II II I M II II M I 

1001 agttaagatacttccttcttagaatattttttgatgtataggtcgcggat 1050 
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Figure 9 (cont.) 

1050 ttacagttacaaacgccaaatctattgttgtggaattccattattaaaaa 1099 

| | | "| | | | | I I I I I I I I I I I M I I I I I I i I I I I I 

1051 atactgrcac actattcgttgtggaattccattataaaaaa 1091 

1100 taaaaac-.agagu-.gtagtttcatctggtgatcaatatcaatatatatt 1149 

I I I I I I 
1092 ataaaaaaaaaaaaaaaaaaa 
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Figure 10: Gap of SEQ ID NO: 2 and SEQ ID NO: 4 

i A I S VF S T S Y S FHKNL>LLH S KQD I LNRP CLL>F S P VWE S PMRKKKTHRAAC 50 

" M[ i*| | . Mhlllll II -II IMIMIIh IIIMII I I 

1 AISVFSSC-YSFYKNLLLDSKPNILKPPCLLFSPWIMSPMRKKKKHGDPC 50 
51 I C S VAEETRNLD I ?Q I EEEEENEEEL I EQTDS G 1 1 HI KKTLGGKQ S RRST 100 

Mill MMIIillMMIII I II I! II 1 1 hllllllllllhl I 

51 I CSVAGRTHMLDI PQI EESEENVEELIEQTDSDI VHI KKTLGGKQSKRPT 100 
101 GSIVAPVS ZLGILSMIGPAVYFKFSRLKECGDIPVAEMGITFAAFVAAAI 150 

iMMii::!iiiiiiiiiiiiiiui!! ii i mi ii mi i him-. 

101 GS I VAPVS CLGILSMIGPAVYFKFSRLMSGGDI PVAEMGI TFATFVAAAV 150 
151 GTEFLSGW\ 7 HKELWHDSLWYIHKSHKRSRKGRFEFNDVFAI INALPAIAL 200 

hum MiiiiihiiiiiiiiiiimiiMiiiiiiiiiiiiMii 

151 GTEFLSAVTv--iKSLWHESLWYIHKSHHRSRKGRFEFNDVFAIINALPAIAL 200 
201 INYGFSNEC-LLPGACFGTGLGTTVCGMAYIFLHNGLSHRRFPVGLIANVP 250 

IMIIIIiMMIIIM IIIII'IIIMIMIIIIIIIIHII II till 

201 INYGFSNE3LLPGACFGVGLGTTVCGMAYIFI.HNGLSHRRFPWLIANVP 250 
251 YFHKLA^AKQIHHSGKFQGVPFGLFLGPQELEEVRGGTEELERVTSRTAK 300 

1 1 1 1| 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 ■ 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 I 

251 YFHKIAAAHQIHHSGKFQGVPFGLFLGPKSLSEVRGGTEELERVISRTTK 300 

301 RTQSST* 3 07 • 

III Ml 
301 RTQPST* 3 07 
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Figure 11: Gap of SEQ ID NO: 2 and Arabidopsis 0-carotene hydroxylase (SEQ 
ID NO: 10) 

1 AISVFSTSYSFHKNLLLHSKQDILNRPCLLF5PVVVESPMRKKKTHRAAC 50 

. ||. -I II ..II- : I * 

1 KAAXLSTAVTFKP. . . LHRSFSSSSTDFRLRLPKSLSGFSPSLRFKRFSV 47 

51 icSVAERTRNIJDIPQIEEEEENEEELIECTDSGIIHIKKTLGGKQSRRST 10 0 

i || -| I II : : : - I l-l III 

4 8 CYWEZF.RQNS P I ENDERPESTS STNAIDAEYLALRLAEKLERKKSERST 97 



101 GS I VA? V5 CLG IL.S.MI GPAVYFKF SRLKECGD I PVAEMGI TFAAFVAAAI 150 
I I -| || II llh: I I II hi • M Ml IN: 
98 YLIAAML.S S FG I TSMAVMAVYYRF SWQMEGGE I SML.EMFGTFALS VGAAV 147 

151 GTE F LS GWVHKEL WHDS LWY I HKS EKR S RKGRFE FND VF AI I NALP A I AL 200 

| || . i |: Ml IN -1-111= l-l II MM I hi I Ml I 
148 GMEFKARWAHRALWHASLWNMHESHHKPRSGPFELNDVFAIVNAGPAIGL 197 

201 INYGFSNEGLLPGACFGTGLGTTVCGKAYIFI.HNGLSHRRFPVGLIANVP 250 

:.||| hlhll III III M I - I I - I - I ' I I MINI Ihll 
198 LS YGFFNKGLVPGLCFGAGLG I TVFG I AYMFVHDGLVHKRF PVGP I ADVP 247 

251 YFHKLAAAHQIHHSGKFQGVPFGLFLGPQELEEVRGGTEELERVTSRTAK 300 

I hllllhlh II MhlllllhlllM II Mh: Ml I 

248 YLRKVAAAHQLHHTDKFNGVPYGLFLGPKELEEV . GGNEELDKEI SRRIK 296 

3 01 RTQSST* 3 07 

297 SYKKASGSGSSSSS* 311 
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Figure 12A (SEQ ID NO: 11) 

I C AT ACC AT AA ATAGTAGAGG ACAACCTACA AACCAACCAC CAGAAACCTC 50 
51 CAATGGCAC 



Figure 12B (SEQ ID NO: 12) 
MAAAI SVFSSGYS FYXNLLLDS KPNILKP PCLLF 5 PVVTMS ? MRKXK2C-IGD ? C I C S VAC-R 

thnzz : ?c- : zszzznvszl i fqtdsd i vhz kxtzggkqs krptgs i vapv'sclg; lsmig 

PAVYFI-IF 5 P-LMS GGD I PVAEMGITFATFVAAAVGTZF 1,5 AWVEKSLWEESLWYIHXSEH3. 
SJtXGF-FFFNDVFAI INALPAIAIilNYGFSNEGI^PGACFGVGLGTTVCGMAYI FLHNGLS 
E2^?VWLIANV?Y7E<IAAAEQIKHSGXF^ 
TTXF.TQFST* 



BNSDOCID: <WO 9961652A1_I_> 



SUBSTITUTE SHEET (RULE 26) 



WO 99/61652 



16/16 



PCT/US99/10455 



Figure 13 



Atl 

At2 

Cal 

Ca2 

AdKl 

AdK6 



*fkr 



* 20 * * 60 

MXAXLSTAVTFKPLHRSPSSSSTDFRLRI»PKslsgfspslR 

-MAAGLSTIAVTIJ^LNRSSFSANHPXstavfppslRFNGFRR rki 

MAABISISASSWlICLQMIPFPAPKyFATAPpllffspltCOTJDAIIASIlRlcpr JaacQvlk : 

TTGRraQLVWCQISFSSTSRTSYYRH^ s 
TTGRxHxQI*yw^ y ^ ^ ^ AIS ypgYSH raJQ,L 4 HSK Q 0 l IlNR ^B 3 "^ * sp wfesfiMRKXXT - hxaali cBvae 

MAAAISVFSSGSFYKin^SKPNaLKPpJllf spwgMSBMRKKKK-HgdpqicBvag 

" a ~ - F«P c v 



52 
53 
62 
71 
56 
59 



Atl 



80 * 100 

: errqNSPlENDERPESTSSTNAIDABYIAL- 



At2 : erkqSSPMDDDNKPESTTSSSEII-MTS- 

Cal : ddklYTAQSGKQSDTEAIGDEIEVETNEEKSIA^ 

Ca2 : ddkfKTQFEAGESDIEMKXBSQISAT— • \ 

AdKl : : 

AdK6 : : 



jEQTDSGII- 
QTDSDIV- 



140 




160 



180 



Atl 




ffimegg 


At2 




ffimkgg 


Cal 




ffimagg 


Ca2 




HKmegg 


AdKl 




Hmecg 


AdK6 




flmegg 

MagG 



rsvlaafigtfai sVgaawg-jef, 
£3 em 4ctfa| afg-aaigge; 
) f s azfig t f ajjs vg - aa vgSo f j 
Itfaalva-aaiglef 
|t£at§va-aavg|ef 
TFA v Aa G S£ 




lwhasl 
IwhdalvmJ 
lwbaslwbf; 
lwhaslw! 
jlwbdsl 
|lwh.aali 
LWH SIM 




* 220 

f alndvf aivnagpai : 195 

afalndvf aitnavpai : 194 

feladifaiinavpai : 209 

f alndvf aiinavpai s 210 

f afndvf aiinalpai : 198 

f afndvf aiiaalpai z 201 
G FB NDvFAI HA PAX 



Atl 

At2 

Cal 

Ca2 

AdKl 

AdK6 



: gllsygffr 

: gllyygflr 

: affsfgfr 

: alldygffi 

: alinygfsr 

: alinygfsi 
1 yGF 



260 




GI* PG CFG GI*G Tv GzaAY F 



* 280 
Jr fpvgHladvpy 1 r kSaaalu 
rf pvgS ianvpylrkHaaaiii 
:t pvgfi iakvpyf qrfflaaain 
Srfpvgfi vanvpylrkSaaaiisj 
Ir £ pvgSi anvpy £ hklaaain 
Jr f pvCJianvpy f hJc^aaalK 
RFPVg iA VPY k AAAHq HH 




300 * 
Atl : glflgpkeleavgg-nael 
At2 : glflgpkqavaavgGkaal 
Cal : glflgpkalaavgv-iaal 
Ca2 : glflgpkelaavgg-laal 
AdKl : glflgpqaloavjgGteal 
AdK6 : glflgpkalaavJgGtaal 
GLFLGPkalaBv g 



320 * 

isrfSiksykkaSGSGSSSSS : 310 

iaJsiklynkgSSTS : 305 

ikslkrl — - 315 

xgtryikgs t 316 

qssT s 30 6 

[qpsT : 309 
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SEQUENCE LISTING 



<110> CUNNINGHAM, Francis X. 

<120> CAROTENOID KE70LASE GENES AND GENE PRODUCTS , PRODUCTION 
OF KETOCAROTENOIDS AND METHODS OF MODIFYING CAROTENOIDS 
USING THE GENES 

<130> 8172-9022 

<140> Unknown 
<141> 1999-05-21 

<150> 60/086,460 
<151> 1998-05-22 

<160> 18 

<170> Patentln Ver . 2.0 

<210> 1 
<211> 1176 
<212> DNA 

<213> Adonis aestivalis 
<400> 1 

agcaatctca gtgttcagta caagttattc tttccacaag aatctcttgt tgcactcaaa 60 

acaagacatt ctcaaccgcc catgtttgct cttctctcca gttgtggtgg agtcgcctat 120 

gagaaagaaa aagacacatc gtgctgcatg tatctgctct gttgcagaga gaacaaggaa 180 

ccttgatatt cctcaaatta aagaagagga agagaacgag gaagaactaa tagaacagac 240 

ggattctggc ataattcata taaagaaaac gctagggggg aaacaatcaa gacggtccac 300 

tggctccatt gtcgcacccg tatcttgtct tgggatcctt tcaatgatcg gacctgctgt 360 

ttacttcaag ttttcacggc taatggagtg tggagatatt cctgtcgcag aaatggggat 420 

tacgtttgcc gcctttgttg ctgctgcgat tggcacggaa tttttgtcag gatgggttca 480 

caaagaactc tggcacgart ctttgtggta cattcacaag tctcaccata ggtcacgaaa 540 

aggccgcttc gagttcaarg atgtgtttgc tattattaac gcgcttcctg ctattgctct 600 

tatcaattat ggattctcaa atgaaggcct ccttcctgga gcctgctttg gtaccggtct 660 

tggaacgaca gtctgtggca tggcttacat ttttcttcac aatggccttt cacaccgaag 720 

gttcccagta gggcttatrg caaacgtccc ttatttccac aagctggctg cagctcacca 780 

aatccatcac tcaggaaaat ttcagggtgt accatttggc ctgttccttg gaccccagga 840 

attggaagaa gtaagaggag gcactgaaga attggagagg gtgatcagtc gtacagctaa 900 

acgaacgcaa tcatctaca: gaatcaactc ttttacattt atgaggtttt agtttatcgg 960 

tgttacaagt cacacatttg tgtcgttgta gtaattcaaa gttaccatac tcttttttag 1020 
aatttttttt tgatgtatag gtcgcggagt tacggttaca aaggccaaat ctattgttgt 1080 
ggaattccat tattaaaaat aaaaattaga gtttgtagtt ttatctggtg atcaatatca 1140 

atatatatta attaaagcaa aaaaaaaaaa aaaaaa 1176 

<210> 2 

1 
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<211> 306 
<212> PRT 

<213> Adonis aestivalis 
<400> 2 

Ala lie Ser Val Phe Ser Thr Ser Tyr Ser Phe His Lys Asn Leu Leu 
1 5 10 15 

Leu His Ser Lys Gin Asp lie Leu Asn Arg Pro Cys Leu Leu Phe Ser 
20 25 30 

rrD VaJ Val Va.i Giu Ser Pro Met Arg Lys Lys Lys Thr His Arg Ala 
35 40 45 

Ala Cyr Zle Cys Ser Val Ala Glu Arg Thr Arg Asn Leu Asp lie Pro 
50 55 60 

Gin lie Giu Glu Glu Giu Glu Asn Glu Glu Glu Leu lie Glu Gin Thr 
65 "70 75 80 

Asp Ser Giy lie lie His lie Lys Lys Thr Leu Gly Gly Lys Gin Ser 

85 90 95 

Arg Arg Ser Thr Gly Ser lie Val Ala Pro Val Ser Cys Leu Gly lie 
100 105 110 

Leu Ser Mei lie Gly Pro Ala Val Tyr Phe Lys Phe Ser Arg Leu Met 
115 120 125 

Glu Cys Giy Asp lie Pro Val Ala Glu Met Gly lie Thr Phe Ala Ala 
130 135 140 

Phe Val Ala Ala Ala lie Gly Thr Glu Phe Leu Ser Giy Trp Val His 
145 150 155 160 

Lys Glu Leu Trp His Asp Ser Leu Trp Tyr lie His Lys Ser His His 
165 170 175 

Arg Ser Arc Lys Gly Arg Phe Glu Phe Asn Asp Val Phe Ala lie lie 
180 185 190 

Asn Ala Leu Pro Ala He Ala Leu He Asn Tyr Gly Phe Ser Asn Glu 
195 200 205 

Gly Leu Leu Pro Gly Ala Cys Phe Gly Thr Gly Leu Gly Thr Thr Val 
210 215 220 

Cys Gly Met A.ia Tyr lie Phe Leu His Asn Gly Leu Ser His Arg Arg 

2 
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Phe Pro Val Gly Leu lie Ala Asn Val Pro Tyr Phe His Lys Leu Ala 



Ala Ala His Gin He His His Ser 
260 

Gly Leu Phe Leu Gly Pro Gin Glu 

275 280 

Glu Giu Leu Glu Arc Val lie Ser 
290 295 

Ser Thr 
305 



Gly Lys Phe Gin Gly Val Pro Phe 
265 270 

Leu Glu Glu Val Arg Gly Gly Thr 
285 

Arg Thr Ala Lys Arg Thr Gin Ser 
300 



<210> 3 
<211> 1112 
<212> DNA 

<213> Adonis aestivalis 
<400> 3 

agcaatttca gtgttcagtt caggttattc tttctacaag aatctcttgt tggactcaaa 60 

accaaatatt ctcaaacccc catgcctgct attctctcca gttgtgatca tgtcgcctat 120 

gagaaagaaa aaqaaacatg gtgatccatg tatctgctcc gttgcaggga gaacaaggaa 180 

ccttgatatt cctcaaattg aagaagagga agagaatgtg gaagaactaa tagaacagac 240 

cgattctgac atagtgcata taaagaaaac actagggggg aaacaatcaa aacggcccac 300 

tggctccatt gtcgcacccg tatcttgtct tgggatcctt tcaatgattg gacctgctgt 360 

ttacttcaag ttttcacggc taatggaggg tggagatata cctgtagcag aaatggggat 420 

tacgtttgcc acctttgttg ctgctgctgt tggcacggag tttttgtcag catgggttca 480 

caaagaactc tggcacgagt ctttgtggta cattcacaag tctcaccatc ggtcacgaaa 540 

aggccgcttc ga.gttcaatg atgtgtttgc tattattaac gcgcttcccg ctattgctct 600 

tatcaattat ggattctcca atgaaggcct ccttcctgga gcgtgctttg gtgtcggtct 660 

tggaacaaca gtctgtggta tggcttacat ttttcttcac aatggcctat cacaccgaag 720 

gttcccagta tggcttattg cgaacgtccc ttatttccac aagctggctg cagctcacca 780 

aatacaccac tcaggaaaat ttcagggtgt accatttggc ctgttccttg gacccaagga 840 

attggaagaa gtaagaggag gcactgaaga gttggagagg gtaatcagtc gtacaactaa 900 

acgaacgcaa ccatctacct gaatcaattt ttttacatat ataaggtttt agtttatcgg 960 

tgttataaaa tcacacatcc gtatcgtttt agtaagtcaa agttaagata cttccttctt 1020 

agaatatttt ttgatgtata ggtcgcggat atactgttac actattcgtt gtggaattcc 1080 

attataaaaa aataaaaaaa aaaaaaaaaa aa 1112 

<210> 4 
<211> 306 
<212> PRT 

<213> Adonis aestivalis 
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<400> 4 

Ala lie Ser Val Phe Ser Ser Gly Tyr Ser Phe Tyr Lys Asn Leu Leu 
15 10 15 

Leu Asp Ser Lys Pro Asn lie Leu Lys Pro Pro Cys Leu Leu Phe Ser 
20 25 30 

Pro Val Val lie Met Ser Pro Met Arg Lys Lys Lys Lys His Gly Asp 
35 40 45 

Pro Cys lie Cys Ser Val Ala Gly Arg Thr Arg Asn Leu Asp He Pro 
50 55 60 

Gin He Glu Glu Glu Glu Glu Asn Val Glu Glu Leu He Glu Gin Thr 
65 70 75 80 

Asp Ser Asp He Val His He Lys Lys Thr Leu Gly Gly Lys Gin Ser 

85 90 95 

Lys Arg Pro Thr Gly Ser He Val Ala Pro Val Ser Cys Leu Gly He 
100 105 110 

Leu Ser Met He Gly Pro Ala Val Tyr Phe Lys Phe Ser Arg Leu Met 
115 120 125 

Glu Gly Gly Asp He Pro Val Ala Glu Met Gly He Thr Phe Ala Thr 
130 135 140 

Phe Val Ala Ala Ala Val Gly Thr Glu Phe Leu Ser Ala Trp Val His 
145 150 155 160 

Lys Glu Leu Trp His Glu Ser Leu Trp Tyr He His Lys Ser His His 
165 170 175 

Arg Ser Arg Lys Gly Arg Phe Glu Phe Asn Asp Val Phe Ala lie He 
180 185 190 

Asn Ala Leu Pro Ala He Ala Leu He Asn Tyr Gly Phe Ser Asn Glu 
195 200 205 

Gly Leu Leu Pro Gly Ala Cys Phe Gly Val Gly Leu Gly Thr Thr Val 
210 215 220 

Cys Gly Met Ala Tyr He Phe Leu His Asn Gly Leu Ser His Arg Arg 
225 230 235 240 

Phe Pro Val Trp Leu He Ala Asn Val Pro Tyr Phe His Lys Leu Ala 

4 
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Ala Ala His Gin lie His His Ser Gly Lys Phe Gin Gly Val Pro Phe 
260 265 270 

Gly Leu Phe Leu Gly Pro Lys Glu Leu Glu Glu Val Arg Gly Gly Thr 
275 280 285 

Glu Glu Leu Glu Arg Val lie Ser Arg Thr Thr Lys Arg Thr Gin Pro 
290 295 300 

Ser Thr 
305 



<210> 5 
<211> 1205 
<212> DNA 

<213> Adonis aestivalis 
<400> 5 

gggctgcagg aattcggcac gagagcaatc 
aagaatctct tgttgcactc aaaacaagac 
ccagttgtgg tggagtcgcc tatgagaaag 
tctgttgcag agagaacaag gaaccttgat 
gaggaagaac taatagaaca gacggattct 
gggaaacaat caagacgg:: cactggctcc 
ctttcaatga tcggacctcc tgtttacttc 
attcctgtcg cagaaatgcg gattacgttt 
gaatttttgt caggatgggt tcacaaagaa 
aagtctcacc ataggtcacg aaaaggccgc 
aacgcgcttc ctgctattgc tcttatcaat 
ggagcctgct ttggtaccgg tcttggaacg 
cacaatggcc tttcacacc; aaggttccca 
cacaagctgg ctgcagctca ccaaatccat 
ggcctgttcc ttggacccca ggaattggaa 
agggtgatca gtcgtacac: taaacgaacg 
tttatgaggt tttagttta- cggtgttaca 
aaagttacca tactcttt:: tagaattttt 
acaaaggcca aatctattg- tgtggaattc 
gttttatctg gtgatcaaia tcaatatata 
tcgag 

<210> 6 

<211> 315 

<212> PRT 

<213> Adonis aestivalis 



tcagtgttca gtacaagtta ttctttccac 60 
attctcaacc gcccatgttt gctcttctct 120 
aaaaagacac atcgtgctgc atgtatctgc 180 
attcctcaaa ttgaagaaga ggaagagaac 240 
ggcataattc atataaagaa aacgctaggg 300 
attgtcgcac ccgtatcttg tcttgggatc 360 
aagttttcac ggctaatgga gtgtggagat 420 
gccgcctttg ttgctgctgc gattggcacg 4 80 
ctctggcacg attctttgtg gtacattcac 540 
ttcgagttca atgatgtgtt tgctattatt 600 
tatggattct caaatgaagg cctccttcct 660 
acagtctgtg gcatggctta catttttctt 720 
gtagggctta ttgcaaacgt cccttatttc 780 
cactcaggaa aatttcaggg tgtaccattt 840 
gaagtaagag gaggcactga agaattggag 900 
caatcatcta catgaatcaa ctcttttaca 960 
agtcacacat ttgtgtcgtt gtagtaattc 1020 
ttttgatgta taggtcgcgg agttacggtt 1080 
cattattaaa aataaaaatt agagtttgta 1140 
ttaattaaag caaaaaaaaa aaaaaaaaac 1200 

1205 



5 
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<400> 6 

Met Gly Leu Gin Glu Phe Gly Thr Arg Ala lie Ser Val Phe Ser Thr 
15 10 15 

Ser Tyr Ser Phe His Lys Asn Leu Leu Leu His Ser Lys Gin Asp lie 
20 25 30 

Leu Asn Arg Pro Cys Leu Leu Phe Ser Pro Val Val Val Glu Ser Pro 
35 40 45 

Met Arg Lys Lys Lys Thr His Arg Ala Ala Cys lie Cys Ser Val Ala 
50 55 60 

Glu Arg Thr Arg Asn Leu Asp lie Pro Gin lie Glu Glu Glu Glu Glu 
65 70 75 80 

Asn Glu Glu Glu Leu He Glu Gin Thr Asp Ser Gly He He His lie 

85 90 95 

Lys Lys Thr Leu Gly Gly Lys Gin Ser Arg Arg Ser Thr Gly Ser He 
100 105 110 

Val Ala Pro Val Ser Cys Leu Gly He Leu Ser Met He Gly Pro Ala 
115 120 125 

Val Tyr Phe Lys Phe Ser Arg Leu Met Glu Cys Gly Asp He Pro Val 
130 135 140 

Ala Glu Met Gly lie Thr Phe Ala Ala Phe Val Ala Ala Ala He Gly 
145 150 155 160 

Thr Glu Phe Leu Ser Gly Trp Val His Lys Glu Leu Trp His Asp Ser 
165 170 175 

Leu Trp Tyr lie His Lys Ser His His Arg Ser Arg Lys Gly Arg Phe 
180 185 190 

Glu Phe Asn Asp Val Phe Ala He lie Asn Ala Leu Pro Ala lie Ala 
195 200 205 

Leu He Asn Tyr Gly Phe Ser Asn Glu Gly Leu Leu Pro Gly Ala Cys 
210 215 220 

Phe Gly Thr Gly Leu Gly Thr Thr Val Cys Gly Met Ala Tyr lie Phe 
225 230 235 240 

Leu His Asn Gly Leu Ser His Arg Arg Phe Pro Val Gly Leu lie Ala 
245 250 255 



BNSOOCIO. <WO 9961652A1 J_> 



WO 99/61652 PCT/US99/10455 



Asn Val Pro Tyr Phe His Lys Leu Ala Ala Ala His Gin lie His His 
260 265 270 

Ser Gly Lys Phe Gin Gly Val Pro Phe Gly Leu Phe Leu Gly Pro Gin 
275 280 285 

Glu Leu Glu Glu Val Arg Gly Gly Thr Glu Glu Leu Glu Arg Val lie 
290 295 300 

Ser Arg Tn: Ala Lys Arg Thr Gin Ser Ser Thr 
305 .310 315 



<210> 7 
<211> 114: 
<212> DNA 

<213> Ador.:.-. aestivalis 
<400> 7 

gggctgcago aattcgqcac gagagcaatt tcagtgttca gttcaggtta ttctttctac 60 

aagaatctct tgttggactc aaaaccaaat attctcaaac ccccatgcct gctattctct 120 

ccagttgtga tcatgtcgcc tatgagaaag aaaaagaaac atggtgatcc atgtatctgc 180 

tccgttgcag ggagaacaag gaaccttgat attcctcaaa ttgaagaaga ggaagagaat 240 

gtggaagaac caatagaaca gaccgattct gacatagtgc atataaagaa aacactaggg 300 

gggaaacaat caaaacggcc cactggctcc attgtcgcac ccgtatcttg tcttgggatc 360 

ctttcaatga ttggacctcc tgtttacttc aagttttcac ggctaatgga gggtggagat 420 

atacctgtag cagaaatggg gattacgttt gccacctttg ttgctgctgc tgttggcacg 480 

gagtttttgt cagcatgggt tcacaaagaa ctctggcacg agtctttgtg gtacattcac 540 

aagtctcacc atcggtcacg aaaaggccgc ttcgagttca atgatgtgtt tgctattatt 600 

aacgcgcttc ccgctattgc tcttatcaat tatggattct ccaatgaagg cctccttcct 660 

ggagcgtgct ttggtgtcgg tcttggaaca acagtctgtg gtatggctta catttttctt 720 

cacaatggcc latcacaccg aaggttccca gtatggctta ttgcgaacgt cccttatttc 780 

cacaagctgg crgcagctca ccaaatacac cactcaggaa aatttcaggg tgtaccattt 840 

ggcctgttcc tiggacccaa ggaattggaa gaagtaagag gaggcactga agagttggag 900 

agggtaatca gtcgtacaac taaacgaacg caaccatcta cctgaatcaa tttttttaca 960 

tatataaggt tttagtttat cggtgttata aaatcacaca tccgtatcgt tttagtaagt 1020 

caaagttaag atacttccti cttagaatat tttttgatgt ataggtcgcg gatatactgt 1080 

tacactattc gttgtggaat tccattataa aaaaataaaa aaaaaaaaaa aaaaactcga 1140 

g " " 1141 

<210> 8 
<211> 315 
<212> PRT 

<213> Adonis aestivalis 
<400> 8 

Met Gly Leu Gin Glu Phe Gly Thr Arg Ala lie Ser Val Phe Ser Ser 
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Gly Tyr Ser Phe Tyr Lys Asn Leu Leu Leu Asp Ser Lys Pro Asn lie 
20 25 30 

Leu Lys Pro Pro Cys Leu Leu Phe Ser Pro Val Val lie Met Ser Pro 
35 40 45 

Met Arg Lys Lys Lys Lys His Gly Asp Pro Cys lie Cys Ser Val Ala 
50 55 60 

Gly Arg Thr Arg Asn Leu Asp lie Pro Gin lie Glu Glu Glu Glu Glu 
65 70 75 80 

Asn Val Glu Glu Leu lie Glu Gin Thr Asp Ser Asp lie Val His lie 

85 90 95 

Lys Lys Thr Leu Giy Gly Lys Gin Ser Lys Arg Pro Thr Gly Ser lie 
100 105 110 

Val Ala Pro Val Ser Cys Leu Gly lie Leu Ser Met lie Gly Pro Ala 
115 120 125 

Val Tyr Phe Lys Phe Ser Arg Leu Met Glu Gly Gly Asp lie Pro Val 
130 135 140 

Ala Glu Met Gly lie Thr Phe Ala Thr Phe Val Ala Ala Ala Val Gly 
145 ' 150 155 160 

Thr Glu Phe Leu Ser Ala Trp Val His Lys Glu Leu Trp His Glu Ser 
165 170 175 

Leu Trp Tyr lie His Lys Ser His His Arg Ser Arg Lys Gly Arg Phe 
180 185 190 

Glu Phe Asn Asp Val Phe Ala lie lie Asn Ala Leu Pro Ala lie Ala 
195 200 205 

Leu lie Asn Tyr Gly ?he Ser Asn Glu Gly Leu Leu Pro Gly Ala Cys 
210 215 220 

Phe Gly Val Gly Leu Gly Thr Thr Val Cys Gly Met Ala Tyr lie Phe 
225 230 235 240 

Leu His Asn Gly Leu Ser His Arg Arg Phe Pro Val Trp Leu lie Ala 
245 250 255 

Asn Val Pro Tyr Phe His Lys Leu Ala Ala Ala His Gin lie His His 

8 
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260 265 270 

Ser Gly Lys Phe Gin Gly Val Pro Phe Gly Leu Phe Leu Gly Pro Lys 
275 280 285 

Glu Leu Glu Glu Val Arg Gly Gly Thr Glu Glu Leu Glu Arg Val lie 
290 295 300 

Ser Arg Thr Thr Lys Arg Thr Gin Pro Ser Thr 
305 310 315 



<210> 9 
<211> 1149 
<212> DMA 

<213> Adonis aestivalis 
<400> 9 

agcaatctcc gtgtccag-= caagttattc tttccacaag aatctcttgt tgcactcaaa 60 
acaagacatt ctcaaccccz catgtttgct cttctctcca gttgtggtgg agtcgcctat 120 
gagaaagaaa aagacaca:: gtgctgcatg tatctgctct gttgcagaga gaacaaggaa 180 
ccttgatatt cctcaaatic aagaagagga agagaacgag gaagaactaa tagaacagac 240 
ggattctggc ataaitcara taaagaaaac gctagggggg aaacaatcaa gacggtccac 300 
tggctccatt gtcgcacccc tatcttgtct tgggatcctt tcaatgatcg gacctgctgt 360 
ttacttcaag ttttcacggc taatggagtg tggagatatt cctgtcgcag aaatggggat 420 
tacgtttgcc gcctttgttg ctgctgcgat tggcacggaa tttttgtcag gatgggttca 480 
caaagaactc tagcacgai: ctttgtggta cattcacaag tctcaccata ggtcacgaaa 540 
aggccgcttc gagttcaaic atgtgtttgc tattattaac gcgcttcctg ctattgctct 600 
tatcaattat ggattctcaa atgaaggcct ccttcctgga gcctgctttg gtaccggtct 660 
tggaacgaca gtctgtggca tggcttacat ttttcttcac aatggccttt cacaccgaag 720 
gttcccagta gggcttatiig caaacgtccc ttatttccac aagctggctg cagctcacca 780 
aatccatcac tcaggaaaar trcagggtgt accatttggc ctgttccttg gaccccagga 840 
attggaagaa gtaagaggag gcactgaaga attggagagg gtgatcagtc gtacagctaa 900 
acgaacgcaa tcatctaca: gaatcaactc ttttacattt atgaggtttt agtttatcgg 960 
tgttacaagt cacacattig tgtcgttgta gtaattcaaa gttaccatac tcttttttag 1020 
aatttttttt tgatgtatag gtcgcggagt tacggttaca aaggccaaat ctattgttgt 1080 
ggaattccat tattaaaaa: aaaaattaga gtttgtagtt ttatctggtg atcaatatca 1140 
atatatatt ' 1149 

<210> 10 

<211> 310 

<212> PRT 

<213> Arabidopsis 

<400> 10 

Met Ala Ala Xaa Leu Ser Thr Ala Val Thr Phe Lys Pro Leu His Arg 
15 10 15 
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Ser Phe Ser Ser SeWfr Thr Asp Phe Arg Leu Arg Leu Pf^^ys Ser 

20 25 30 

Leu Ser Gly Phe Ser Pro Ser Leu Arg Phe Lys Arg Phe Ser Val Cys 
35 40 45 

Tyr Val Val Glu Glu Arg Arg Gin Asn Ser Pro lie Glu Asn Asp Glu 
50 55 60 

Arg Pro Glu Ser Thr Ser Ser Thr Asn Ala lie Asp Ala Glu Tyr Leu 
65 70 75 80 

Ala Leu A: j Leu Ala Glu Lys Leu Glu Arg Lys Lys Ser Glu Arg Ser 

85 90 95 

Thr Tyr Lej lie Ai^ Ala Met Leu Ser Ser Phe Gly lie Thr Ser Met 
100 105 110 

Ala Val Mc". Ala Yil Tyr Tyr Arg Phe Ser Trp Gin Met Glu Gly Gly 
lib 120 125 

Glu lie Ser Met Leu Glu Met Phe Gly Thr Phe Ala Leu Ser Val Gly 
130 135 140 

Ala Ala Val Gly Met Glu Phe Trp Ala Arg Trp Ala His Arg Ala Leu 
145 150 155 160 

Trp His Ala Ser Leu Trp Asn Met His Glu Ser His His Lys Pro Arg 
165 170 175 

Glu Gly Pro Phe Glu Leu Asn Asp Val Phe Ala lie Val Asn Ala Gly 
180 185 190 

Pro Ala lie Gly Leu Leu Ser Tyr Gly Phe Phe Asn Lys Gly Leu Val 
195 200 205 

Pro Gly Leu Cys Phe Gly Ala Gly Leu Gly lie Thr Val Phe Gly lie 
210 215 220 

Ala Tyr Met Phe Val His Asp Gly Leu Val His Lys Arg Phe Pro Val 
225 230 235 240 

Gly Pro lie Ala Asp Val Pro Tyr Leu Arg Lys Val Ala Ala Ala His 
245 250 255 

Gin Leu His His Thr Asp Lys Phe Asn Gly Val Pro Tyr Gly Leu Phe 
260 265 270 
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Leu Gly Pro Lys Glu Leu Glu Glu Val Gly Gly Asn Glu Glu Leu Asp 
275 280 285 

Lys Glu He Ser Arg Arg He Lys Ser Tyr Lys Lys Ala Ser Gly Ser 
290 295 300 

Gly Ser Ser Ser Ser Ser 

305 310 



<?.10> 1: 
•-111- • DMA 

< 7: i / • A i! 'j:.:s aestivalis 
<400> 11 

cataccataa atagtagacg acaacctaca aaccaaccac cagaaacctc caatggcagc 60 

<210> 12 
<211> 30^ 
<212> PRT 

<213> Adonis aestivalis 
<400> 12 

Met Ala Ala Ala He Ser Val Phe Ser Ser Gly Tyr Ser Phe Tyr Lys 
1 5 10 15 

Asn Leu Leu Leu Asp Ser Lys Pro Asn lie Leu Lys Pro Pro Cys Leu 
20 25 30 

Leu Phe Ser Pro Val Val He Met Ser Pro Met Arg Lys Lys Lys Lys 
35 40 45 

His Gly Asp Pro Cys He Cys Ser Val Ala Gly Arg Thr Arg Asn Leu 
50 55 60 

Asp He Pro Gin lie Glu Glu Glu Glu Glu Asn Val Glu Glu Leu He 
65 70 75 80 

Glu Gin Thr Asp Ser Asp lie Val His lie Lys Lys Thr Leu Gly Gly 
85 90 95 

Lys Gin Ser Lys Arg Pro Thr Gly Ser lie Val Ala Pro Val Ser Cys 
100 105 110 

Leu Gly He Leu Ser Met lie Gly Pro Ala Val Tyr Phe Lys Phe Ser 
115 120 125 
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Arg Leu Met Glu Gly^r^ Asp lie Pro Val Ala Glu Met Gl^^^e Thr 

130 135 140 

Phe Ala Thr Phe Val Ala Ala Ala Val Gly Thr Glu Phe Leu Ser Ala 
145 150 155 160 

Trp Val His Lys Glu Leu Trp His Glu Ser Leu Trp Tyr lie His Lys 
165 170 175 

Ser His His Arg Ser Arg Lys Gly Arg Phe Glu Phe Asn Asp Val Phe 
180 185 190 

Ala He He Asn Ala Leu Pro Ala He Ala Leu He Asn Tyr Gly Phe 
195 200 205 

Ser Asn Glu Gly Leu Leu Pro Gly Ala Cys Phe Gly Val Gly Leu Gly 
210 215 220 

Thr Thr Val Cys Gly Met Ala Tyr lie Phe Leu His Asn Gly Leu Ser 
225 230 235 240 

His Arg Arg Phe Pro Val Trp Leu He Ala Asn Val Pro Tyr Phe His 
245 250 255 

Lys Leu Ala Ala Ala His Gin He His His Ser Gly Lys Phe Gin Gly 
260 265 270 

Val Pro Phe Gly Leu Phe Leu Gly Pro Lys Glu Leu Glu Glu Val Arg 
275 280 285 

Gly Gly Thr Glu Glu Leu Glu Arg Val He Ser Arg Thr Thr Lys Arg 
290 295 300 

Thr Gin Pro Ser Thr 
305 



<210> 13 

<211> 310 

<212> PRT 

<213> Arabidopsis 

<400> 13 

Met Ala Ala Xaa Leu Ser Thr Ala Val Thr Phe Lys Pro Leu His Arg 
15 10 15 

Ser Phe Ser Ser Ser Ser Thr Asp Phe Arg Leu Arg Leu Pro Lys Ser 
20 25 30 

12 
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Leu Ser Gly Phe Ser Pro Ser Leu Arg Phe Lys Arg Phe Ser Val Cys 
35 40 45 

Tyr Val Val Glu Glu Arg Arg Gin Asn Ser Pro lie Glu Asn Asp Glu 
50 55 60 

Arg Pro Glu Ser Thr Ser Ser Thr Asn Ala lie Asp Ala Glu Tyr Leu 
65 70 75 80 

Ala Leu Arg Leu Ala Glu Lys Leu Glu Arg Lys Lys Ser Glu Arg Ser 
85 90 95 

Thr Tyr Leu lie Ala Ala Met Leu Ser Ser Phe Gly lie Thr Ser Met 
100 105 110 

Ala Val Met Ala Val Tyr Tyr Arg Phe Ser Trp Gin Met Glu Gly Gly 
115 120 125 

Glu lie Ser Met Leu Glu Met Phe Gly Thr Phe Ala Leu Ser Val Gly 
130 135 140 

Ala Ala Val Gly Met Glu Phe Trp Ala Arg Trp Ala His Arg Ala Leu 
145 150 155 160 

Trp His Ala Ser Leu Trp Asn Met His Glu Ser His His Lys Pro Arg 
165 170 175 

Glu Gly Pro Phe Glu Leu Asn Asp Val Phe Ala lie Val Asn Ala Gly 
180 185 190 

Pro Ala He Gly Leu Leu Ser Tyr Gly Phe Phe Asn Lys Gly Leu Val 
195 200 205 

Pro Gly Leu Cys Phe Gly Ala Gly Leu Gly He Thr Val Phe Gly He 
210 215 220 

Ala Tyr Met Phe Val Kis Asp Gly Leu Val His Lys Arg Phe Pro Val 
225 230 235 240 

Gly Pro He Ala Asp Val Pro Tyr Leu Arg Lys Val Ala Ala Ala His 
245 250 255 

Gin Leu His His Thr Asp Lys Phe Asn Gly Val Pro Tyr Gly Leu Phe 
260 265 270 

Leu Gly Pro Lys Glu Leu Glu Glu Val Gly Gly Asn Glu Glu Leu Asp 
275 280 285 

13 
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Lys Glu lie Ser Arg Arg lie Lys Ser Tyr Lys Lys Ala Ser Gly Ser 
290 295 300 

Gly Ser Ser Ser Ser Ser 
305 310 



<210> 14 

<211> 305 

<212> PF.T 

<213> Arabidopsis 

<400v 14 

Met Ale Gly Leu Ser Thr lie Ala Val Thr Leu Lys Pro Leu Asn 

15 10 15 

Arg Ser Ser Phe Ser Ala Asn His Pro lie Ser Thr Ala Val Phe Pro 
20 25 30 

Pro Ser Leu Arg Phe Asn Gly Phe Arg Arg Arg Lys lie Leu Thr Val 
35 40 45 

Cys Phe Val Val Glu Glu Arg Lys Gin Ser Ser Pro Met Asp Asp Asp 
50 55 60 

Asn Lys Pro Glu Ser Thr Thr Ser Ser Ser Glu lie Leu Met Thr Ser 
65 "70 75 80 

Arg Leu Leu Lys Lys Ala Glu Lys Lys Lys Ser Glu Arg Phe Thr Tyr 

85 90 95 

Leu lie Ala Ala Val Met Ser Ser Phe Gly lie Thr Ser Met Ala lie 
100 105 110 

Met Ala Val Tyr Tyr Arg Phe Ser Trp Gin Met Lys Gly Gly Glu Val 
115 120 125 

Ser Val Leu Glu Met Phe Gly Thr Phe Ala Leu Ser Val Gly Ala Ala 
130 135 140 

Val Val Gly Met Glu Phe Trp Ala Arg Trp Ala His Arg Ala Leu Trp 
145 150 155 160 

His Asp Ser Leu Trp Asn Met His Glu Ser His His Lys Pro Arg Glu 
165 170 175 

Gly Ala Phe Glu Leu Asn Asp Val Phe Ala lie Thr Asn Ala Val Pro 
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180 185 190 

Ala lie Gly Leu Leu Tyr Tyr Gly Phe Leu Asn Lys Gly Leu Val Pro 
195 200 205 



Gly Leu Cys Phe 
210 

Tyr Met Phe Val 

225 

Pro lie Ala Asn 



Leu His His Thr 
260 

Gly Pro Lys Gin 
275 

Lys Glu lie Ser 
290 

Ser 
305 



Gly Ala- Gly Leu 
215 

His Asp Gly Leu 
230 

Val Pro Tyr Leu 
245 

Asp Lys Phe Lys 



Glu Val Glu Glu 
280 

Arg Arg lie Lys 
295 



Gly lie Thr Met 
220 

Val His Lys Arg 
235 

Arg Lys Val Ala 
250 

Gly Val Pro Tyr 
265 

Val Gly Gly Lys 



Leu Tyr Asn Lys 
300 



Phe Gly Met Ala 



Phe Pro Val Gly 
240 

Ala Ala His Gin 
255 

Gly Leu Phe Leu 
270 

Glu Glu Leu Glu 
285 

Gly Ser Ser Thr 



<210> 15 

<211> 315 

<212> PRT 

<213> Capsicum annuum 



<400> 15 

Met Ala Ala Glu lie Ser lie Ser 
1 5 

Gin Arg Asn Pro Phe Pro Ala Pro 
20 

Leu Leu Phe Phe Ser Pro Leu Thr 

35 40 



Ala Ser Ser Arg Ala lie Cys Leu 
10 15 

Lys Tyr Phe Ala Thr Ala Pro Pro 
25 30 

Cys Asn Leu Asp Ala lie Leu Arg 
45 



Ser Arg Arg Lys Pro Arg Leu Ala Ala Cys Phe Val Leu Lys Asp Asp 

50 55 60 

Lys Leu Tyr Thr Ala Gin Ser Gly Lys Gin Ser Asp Thr Glu Ala lie 

65 70 75 80 
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Gly Asp Glu lie Glu^^ Glu Thr Asn Glu Glu Lys Ser Leu^la Val 

85 90 95 

Arg Leu Ala Glu Lys Phe Ala Arg Lys Lys Ser Glu Arg Phe Thr Tyr 
100 105 110 

Leu Val Ala Ala Val Met Ser Ser Leu Gly lie Thr Ser Met Ala Val 
115 120 125 

lie Ser Val Tyr Tyr Arg Phe Ser Trp Gin Met Glu Gly Gly Glu Met 
130 135 140 

Pro Phe Ser Glu Met Phe Cys Thr Phe Ala Leu Ala Phe Gly Ala Ala 
145 150 155 160 

lie Gly Met Glu Tyr Trp Ala Arg Trp Ala His Arg Ala Leu Trp His 
165 170 175 

Ala Ser Leu Trp His Met His Glu Ser His His Arg Pro Arg Glu Gly 
180 185 190 

Pro Phe Glu Leu Asn Asp lie Phe Ala lie lie Asn Ala Val Pro Ala 
195 200 205 

lie Ala Phe Phe Ser Phe Gly Phe Asn His Lys Gly Leu lie Pro Gly 
210 215 220 

lie Cys Phe Gly Ala Gly Leu Gly lie Thr Val Phe Gly Met Ala Tyr 
225 230 235 240 

Met Phe Val His Asp Gly Leu Val His Lys Arg Phe Pro Val Gly Pro 
245 250 255 

lie Ala Lys Val Pro Tyr Phe Gin Arg Val Ala Ala Ala His Gin Leu 
260 265 270 

His His Ser Asp Lys Phe Asp Gly Val Pro Tyr Gly Leu Phe Leu Gly 
275 280 285 

Pro Lys Glu Leu Glu Glu Val Gly Val lie Glu Glu Leu Glu Lys Glu 
290 295 300 

Val Asn Arg Arg lie Lys Ser Leu Lys Arg Leu 
305 310 315 



<210> 16 
<211> 316 
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<212> PRT 

<213> Capsicum annuum 
<400> 16 

Thr Thr Gly Arg Tyr His Tyr Gin Leu Val Trp Cys Gin lie Ser Phe 
1 5 10 15 

Ser Ser Thr Ser Arg Thr Ser Tyr Tyr Arg His Ser Pro Phe Leu Gly 
20 25 30 

Pro Lys Pro Thr Pro Thr Thr Pro Ser Val Tyr Pro lie Thr Pro Phe 
35 40 45 

Ser Pro Asn Leu Gly Ser lie Leu Arg Cys Arg Arg Arg Pro Ser Phe 
50 55 60 

Thr Val Cys Phe Val Leu Glu Asp Asp Lys Phe Lys Thr Gin Phe Glu 
65 70 75 80 

Ala Gly Glu Glu Asp lie Glu Met Lys lie Glu Glu Gin lie Ser Ala 
85 90 95 

Thr Arg Leu Ala Glu Lys Leu Ala Arg Lys Lys Ser Glu Arg Phe Thr 
100 105 110 

Tyr Leu Val Ala Ala Val Met Ser Ser Phe Gly lie Thr Ser Met Ala 
115 120 125 

Val Met Ala Val Tyr Tyr Arg Phe Tyr Trp Gin Met Glu Gly Gly Glu 
130 135 140 

Val Pro Phe Ser Glu Met Phe Gly Thr Phe Ala Leu Ser Val Gly Ala 
145 150 155 160 

Ala Val Gly Met Glu Phe Trp Ala Arg Trp Ala His Lys Ala Leu Trp 
165 170 175 

His Ala Ser Leu Trp His Met His Glu Ser His His Lys Pro Arg Glu 
180 185 190 

Gly Pro Phe Glu Leu Asn Asp Val Phe Ala lie lie Asn Ala Val Pro 
195 200 205 

Ala lie Ala Leu Leu Asp Tyr Gly Phe Phe His Lys Gly Leu lie Pro 
210 215 220 

Gly Leu Cys Phe Gly Ala Gly Leu Gly lie Thr Val Phe Gly Met Ala 
225 230 235 240 

17 



BNSDOCID: <WO 9961652A1 J_> 



WO 99/61652 PCT/US99/1045S 



Tyr Met Phe Val His Asp Gly Leu Val His Lys Arg Phe Pro Val Gly 
245 250 255 

Pro Val Ala Asn Val Pro Tyr Leu Arg Lys Val Ala Ala Ala His Ser 
260 265 270 

Leu His His Ser Glu Lys Phe Asn Gly Val Pro Tyr Gly Leu Phe Leu 
275 280 285 

Gly Pro Lys Glu Leu Glu Glu Val Gly Gly Leu Glu Glu Leu Glu Lys 
290 295 300 

Glu Val Asn Arg Arg Thr Arg Tyr lie Lys Gly Ser 
305 310 315 



<210> 17 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 17 

cagaatcggt ctgttctatt agttcttcc 29 

<210> 18 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 18 

caatttgagg aatatcaagc ttccttgttc tc 32 
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